Constructing a corpus-informed list of Arabic formulaic sequences (ArFSs) for language pedagogy and technology
نویسندگان
چکیده
منابع مشابه
A Corpus-driven Food Science and Technology Academic Word List
The overarching goal of this study was to create a list of the most frequently occurring academic words in Food Science and Technology (FST). To this end, a 4,652,444-word corpus called Food Science and Technology Research Articles (FSTRA), which included 1,421 research articles (RAs) randomly selected from 38 journals across five sub-disciplines in FST, was developed. Frequency and range-based...
متن کاملThe Potential of Corpus-informed L2 Pedagogy
Corpus linguistic methods have led to many revelations about the nature of language use and language learning which are otherwise untenable. To date, however, these findings have not had major impact on L2 (second and foreign language) teaching, except in a few areas like L2 for specialized purposes. This may be due to a number of reasons, including issues of theoretical commensurability and th...
متن کاملFormulaic expressions in language technology
Little attention has been given to the treatment of formulaic expressions in language technology during the past few decades, although such expressions are in fact extremely common both in conversation and in written discourse. ComputerAssisted Translation (CAT) tools are currently the most widely used language technology tools among professional translators. In this paper, I intend to determin...
متن کاملLinguistically Informed and Corpus Informed Morphological Analysis of Arabic
Standard English PoS-taggers generally involve tag-assignment (via dictionary-lookup etc) followed by tag-disambiguation (via a context model, e.g. PoS-ngrams or Brill transformations). We want to PoS-tag our Arabic Corpus, but evaluation of existing PoStaggers has highlighted shortcomings; in particular, about a quarter of all word tokens are not assigned a fully correct morphological analysis...
متن کاملConstructing An Automatic Lexicon for Arabic Language
In this paper, we have designed and implemented a system for building an Automatic Lexicon for the Arabic language. Our Arabic Lexicon contains word specific information. These pieces of information include; morphological information such as the root (stem) of the word, its pattern and its affixes, the part-of-speech tag of the word, which classifies it as a noun, verb or particle; lexical attr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Corpus Linguistics
سال: 2019
ISSN: 1384-6655,1569-9811
DOI: 10.1075/ijcl.16088.alg